Generating emotional speech with a concatenative synthesizer

نویسندگان

  • Erhard Rank
  • Hannes Pirker
چکیده

We describe the attempt to synthesize emotional speech with a concatenative speech synthesizer using a parameter space covering not only f0, duration and amplitude, but also voice quality parameters, spectral energy distribution, harmonics-to-noise ratio, and articulatory precision. The application of these extended parameter set offers the possibility to combine the high segmental quality of concatenative synthesis with a wider range of control settings needed for the synthesis of natural affected speech.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A concatenative Mandarin TTS system without prosody model and prosody modification

This paper proposes a two-step solution for generating natural prosody in TTS, in which no prosody prediction and modification are needed. A large phonetically and prosodically enriched speech corpus has been collected as the unit pool for the synthesizer. A multi-tier non-uniform unit selection scheme is developed to pick up the most suitable segments for concatenation from the unit pool. Fina...

متن کامل

Expressive speech synthesis using a concatenative synthesizer

1 This paper describes an experiment in synthesizing four emotional states anger, happiness, sadness and neutral – using a concatenative speech synthesizer. To achieve this, five emotionally (i.e., semantically) unbiased target sentences were prepared. Then, separate speech inventories, comprising the target diphones for each of the above emotions, were recorded. Using the 16 different combinat...

متن کامل

Exploiting improved parameter smoothing within a hybrid concatenative/LPC speech synthesizer

We depict the interpolation strategies for the concatenation of inventory demisyllables in our hybrid concatenative/LPC speech synthesizer. Inventory elements for vowels and nasals are cut in the steady state of the phoneme. Concatenating elements in the synthesis stage requires smoothing of spectral content and energy to avoid annoying discontinuities in these parameters, which is of vital imp...

متن کامل

A Flexible, Scalable Finite-state Transducer Architecture for Corpus-based Concatenative Speech Synthesis1

In this paper we describe our work involving the conversion of our phonologically-based synthesizer into a finite-state transducer (FST) representation which can be used for real-time natural-sounding synthesis. We have designed a transducer structure to efficiently perform the common task of unit selection in concatenative speech synthesis. By encapsulating domainindependent concatenative synt...

متن کامل

Verification of Acoustical Correlates of Emotional Speech using Formant-Synthesis

This paper explores the perceptual relevance of acoustical correlates of emotional speech by means of speech synthesis. Besides, the research aims at the development of »emotionrules« which enable an optimized speech synthesis system to generate emotional speech. Two investigations using this synthesizer are described: 1) the systematic variation of selected acoustical features to gain a prelim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998